AD-A222  074 


Office  of  Naval  Research 
Contract  N00014-88-K-0472 
R  T  Code  413p008 


°  p  DV  Technical  Report  No.  5 


Molecular  Modeling  Studies  of  Cholic  Acid  Host: 
Shape  And  Dimensions,  n. 


by 

Suzanne  M.  Evans*  and  Carol  A.  Venanzi 


New  Jersey  Institute  of  Technology 
Chemistry  Division 
Newark,  NJ  07102 


May  21,  1990 


*:  The  BOC  Technical  Center,  Murray  Hill,  NJ 

Reproduction  in  whole  or  in  part  is  permitted  for  any  purpose  of  the  United  States 

Government. 


This  document  has  been  approved  for  public  release  and  sale;  its  distribution  in 

unlimited. 


014' 


REPORT  DOCUMENTATION  PAGE 


Form  Approved 
OMB  H o  0704-OtB* 


1*  REPORT  SECURITY  CLASSIFICATION 

Unclassified 


2*  SECURITY  CLASSIFICATION  AUTHORITY 


1b  RESTRICTIVE  MARKINGS 


1  DISTRIBUTION /AVAILABILITY  OF  REPORT 


2b  OECLASSIFICATION  /  DOWNGRADING  SCHEDULE 


4  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 


S  MONITORING  ORGANIZAT'ON  REPORT  NUMBER(S) 


64  NAME  OF  PERFORMING  ORGANIZATION 

New  Jersey  Institute  of 
Technology 

6c  ADDRESS  (Crtv,  State,  and  ZIP  Code) 
Chemistry  Division 
University  Heights 
Newark,  New  Jersey  07102 


84  NAME  OF  FUNDING /SPONSORING 
ORGANIZATION 

Office  of  Naval  Research 


Be  ADDRESS  (City,  Sta te,  and  ZIP  Code) 
800  N.  Quincy  Street 
Arlington,  VA  2217-5000 


,  6b  OFFICE  SYMBOL  7a  NAME  OF  MONITORING  ORGANIZATION 
(  (li  applicable) 


7b  ADDRESS  (City,  State,  and  ZIP  Code) 


8b.  OFFICE  SYMBOL  9  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 
(If  applicable) 

0NR 


10  SOURCE  OF  FUNDING  NUMBERS 


PROJECT 

TASK 

NO 

NO 

WORK  UNIT 
ACCESSION  NO 


1 1 .  TITLE  (Include  Security  Classification) 

Molecular  Modeling  Studies  of  Cholic  Acid  Host:  Shape  and  Dimensions.  II. 


12.  PERSONAL  AUTHOR(S) 

Suzanne  M.  Evans  and  Carol  A.  Venanzi 


134  TYPE  OF  REPORT 


13b  TIME  COVERED 
FROM  TO 


14.  DATE  OF  REPORT  (Year.  Month.  Day)  15  PAGE  COUNT 

May  21,  1990  18 


COSATI  CODES 


GROUP  SUB-GROUP 


SUBJECT  TERMS  ( Continue  on  reverse  if  necessary  and  identify  by  block  number) 


MoLfct 
C>  &  / 1 1-  N 


nt-p  M  Sr 


19  ABSTRACT  (Continue  ph  reverse  if  necessary  and  identify  by  block  number)  "jJ, 

Earlier  studies'^Technical  Report  No.  3  on  the  three-dimensional  "clam"  or  cup  shades" 
formed  by  two  cholic  acid  molecules  linked  together  by  a  substituted  aromatic  spacer  had 
shown  that  the  resulting  cavities  were  of  similar  shape  but  different  width,  due  to  the 
flexible  nature  of  this  host  system,  with  its  sensitivity  to  minor  bond  rotations.  In 
the  L-arabinose  binding  protein,  the  binding  pocket  shape  is  cup-like,  approximately 
12-12.3  S *iong  and  7.3  Awi3e#'Ctaken  from  crystallographic  coordinates.  Fig.  1).  in 
order  to  mimic  the  sugar  binding  proteins,  wherein  the  sugar  hydroxyl  groups  are  hydrogen 
bonded  to  the  ionized  amino  acids  of  the  protein  through  the  heteroatoms^  (with  or  without 
an  intervening  water  network.  Fig.  2f“lt  was  necessary  to  investigate  the  dimensions  of 
the  cholic  acid  host  cavities  more  closely^ 


20  DISTRIBUTION/ AVAILABILITY  OF  ABSTRACT 
E  UNCLASSIFIED/UNLIMITED  □  SAME  AS  RPT. 


224  NAME  OF  RESPONSIBLE  INDIVIDUAL 

Harold  E.  Guard 


DO  Form  1473.  JUN  86 


21.  ABSTRACT  SECURITY  CLASSIFICATION 

□  otic  users  Unclassified 


22b  TELEPHONE  (Include  Area  Code)  22c  OFFICE  SYMBOL 

(202)  696-4402  0NR 


Previous  editions  are  obsolete. 


SECURITY  CLASSIFICATION  OF  THIS  PAGE 


19.  ABSTRACT  -continued- 

The  present  study  extends  th-  earlier  report  by  defining  a  •’standard" 
for  the  cholic  acid  host  structure,  one  that  uses  two  dihedral  angles, 
each  CH2  to  the  aromatic  ring,  to  define  the  origin  direction  of  the 
two  arms  (or  halves)  of  the  clam  shapes.  The  definition 
is  applied  to  cholic  acid  host  systems  containing  a  m-xylylene  and  a 
p-xylylene  aromatic  spacer.  Subsequent  manipulation  of  two  critical 
dihedral  angles  in  the  flexible  portion  of  the  hosts  creates  cavities 
which  are  then  evaluated  geometrically  by  measuring  the  interatomic 
distances  between  key  heteroatoms  (hydroxyls)  in  each  arm.  The  90 
and  120  degree  clam  shapes  of  the  m-xylylene  cholic  acid  host  are 
the  closest  approximations  to  the  L-arabinose  binding  protein, 
although  these  shapes  are  not  perfect  "clams"  since  they  have  the  top 
arm  offset  relative  to  the  bottom  arm.  In  the  p-xylylene  cholic  acid 
hosts,  two  90  degree  structures  seem  to  create  good  clam  shapes  in 
which  to  dock  a  monosaccharide  guest.  Geometry  optimization  with 
energy  minimization  of  these  two  hosts  results  in  very  little  change 
in  the  cavity  shapes  but  does  relieve  several  bad  steric  interactions. 
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The  Cholic  Acid  Host  System  Shape 

Abstract 


Earlier  studies  on  the  three-dimensional  “dam"  or  cup  shapes  formed  by  two  cholic 
acid  molecules  linked  together  by  a  substituted  aromatic  spacer  had  shown  that  the 
resulting  cavities  were  of  similar  shape  but  different  width,  due  to  the  flexibile  nature  of 
this  host  system,  with  its  sensitivity  to  minor  bond  rotations  (see  report  entitled  The 
Cholic  Acid  Host  System  Shape  of  2/25/90).  In  the  L-arabinose  binding  protein,  the 
binding  pocket  shape  is  cup-like,  approximately  12-12.3  A  long  and  7.3  A  wide  (taken 
from  crystallographic  coordinates,  Fig.  1).  In  order  to  mimic  the  sugar  binding  proteins, 
wherein  the  sugar  hydroxyl  groups  are  hydrogen  bonded  to  the  ionized  amino  acids  of 
the  protein  through  the  heteroatoms  (with  or  without  an  intervening  water  network, 

Fig.2),  it  was  necessary  to  investigate  the  dimensions  of  the  cholic  acid  host  cavities 
more  closely. 

The  present  study  extends  the  earlier  report  by  defining  a  “standard”  for  the  cholic 
acid  host  structure,  one  that  uses  two  dihedral  angles,  each  CH2  to  the  aromatic  ring,  to 

define  the  origin  direction  of  the  two  arms  (or  halves)  of  the  clam  shapes.  The  definition 
is  applied  to  cholic  acid  host  systems  containing  a  m-xylylene  and  a  p-xylylene  aromatic 
spacer.  Subsequent  manipulation  of  two  critical  dihedral  angles  in  the  flexible  portion 
of  the  hosts  creates  cavities  which  are  then  evaluated  geometrically  by  measuring  the 
interatomic  distances  between  key  heteroatoms  (hydroxyls)  in  each  arm.  The  90  and 
120  degree  clam  shapes  of  the  m-xylylene  cholic  acid  host  are  the  closest 
approximations  to  the  L-arabinose  binding  protein,  although  these  shapes  are  not 
perfect  “clams"  since  they  have  the  top  arm  offset  relative  to  the  bottom  arm.  In  the  p- 
xylylene  cholic  acid  hosts,  two  90  degree  structures  seem  to  create  good  dam  shapes 
in  which  to  dock  a  monosaccharide  guest.  Geometry  optimization  with  energy 
minimization  of  these  two  hosts  results  in  very  little  change  in  the  cavity  shapes  but 
does  relieve  several  bad  steric  interactions.  The  size  of  the  p-xylylene  cavities  are 
approximately  12  A  long  and  9-11  A  wide. 
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Meta-xvlylene  Aromatic  Spacer 


The  torsional  angle  values  for  bonds  labeled  a,  b,  c, d,  and  e  were  obtained  from 
crystallographic  coordinates  (refcode  =  BUGJES),  while  f  and  g  were  set  to  create  the 
reference  conformation  shown  below.  While  a,  c,  d,  and  a  remained  fixed  in  both 
halves  of  the  cholic  acid  host  containing  the  meta-xylylene  spacer,  dihedral  angles  b 
(Ci7--C2q--C22”C23)  and  g  (benzylic  C2— Har)  were  varied.  Dihedral  angle 

b  has  the  most  direct  effect  on  the  cholic  acid  arm  vector  (directionality),  whereas 
dihedral  angle  g  alleviates  the  bad  steric  interactions,  found  in  the  reference  torsional 
angle  of  0  degrees,  between  the  hydrogen  of  the  N-H  in  the  amide  and  the  ortho-proton 
C2-H  of  the  benzene  ring. 


Reference 

a  =  +173.6 
b  =  +60.6 
C= +179.3 
d  = -168.7 
e  =  +172.6 
f  =+180.0 
g=  0 


The  “standard”  for  the  cholic  acid  host  is  defined  as  shown:  The  molecule  starts  at  the 
left,  proceeding  along  arm  A  through  the  aromatic  spacer  toward  arm  B,  always  with  a 
negative  g  dihedral  angle  in  arm  A,  using  C-j  •  and  Cp-  Dihedral  angle  g  to  arm  B  is 

defined  using  03*  and  C2>- 


Tdef.ned 

3M-£  -C|f 


Nfc6jAT\\»te 


Arw  B~ 
a  defneci  aS 
3n-c-c5,'C2, 


The  following  table  of  dihedral  angle  combinations  thus  results  from  the  manipulation  of 
dihedral  angle  g. 

Dihedral  Angles 


Name 

Bond 

20-22  (b) 

Bond  g  to 

Aromatic  Ring 

Arm  A 

AfflLfi 

Arm  A 

Arm  B 

JBO  Clam 

+60.6 

+60.6 

-60 

+60 

_90  Clam 

+60.6 

+60.6 

-90 

+90 

_1 20  Clam 

+60.6 

+60.6 

-120 

+120 

1 50  Clam 

+60.6 

+60.6 

-150 

+150 

Each  of  the  four  cholic  acid  hosts  above  have  the  top  arm  A  offset  relative  to  the  bottom 
arm  B,  thereby  creating  the  effect  of  hinge  movement.  Although  the  general  shape  is 
that  of  a  “C"  or  dam,  the  cavities  are  of  different  dimensions  (Fig.  3).  The  interatomic 
distances  (in  A)  between  the  cholic  add  hydroxyls  in  each  arm  reflect  the  cavity  size, 
which  differ  mainly  in  width. 


Name 

A  Distance  Arm  A  //  B 

03 

07 

012 

03 

7.31 

7.73 

5.18 

_6C  Clam 

07 

10.12 

10.09 

5.81 

012 

1 

1 

7.31 

7.75 

3.97 

A  //  B 

03 

07 

012 

03 

8.66 

10.87 

7.51 

_90  Clam 

07 

11.98 

12.54 

8.23 

012 

10.64 

11.48 

7.84 

A  /IB 

03 

07 

012 

03 

13.77 

15.75 

12.11 

_1 20  Clam 

07 

15.72 

16.21 

12.08 

012 

14.85 

15.28 

11.86 

A  //  B 

03 

07 

012 

03 

18.44 

19.67 

16.34 

_150  Clam 

07 

19.38 

19.38 

15.77 

012 

17.97 

17.77 

14.91 

Comparing  the  above  synthetic  cavities  to  the  dimensions  of  L-arabinose  binding 
protein,  12  A  long  and  7A  wide,  the  _60  Clam  is  dearly  too  small  in  the  interior.  In 
addition,  this  dam  shape  has  its  ends  (A- rings  of  the  two  steroidal  arms)  overlapping. 
The  _1 50  Clam  shape  dearly  has  the  largest  cavity,  although  this  may  indeed  be  too 
large  for  a  monosaccharide  to  maximize  its  hydrogen  bonding  interadions  with  both 
steroidal  arms.  The  _120  and  _150  Clam  structures,  however,  seem  to  be  reasonable 
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cavities  in  which  to  dock  a  sugar  guest ,  with  the  _90  Clam  most  closely  resembling  the 
cavity  in  L-arabinose  binding  protein. 

Manipulation  of  dihedral  angles  b  (C20-C22)  and  g  were  done  in  several  structures  to 

evaluate  the  effect  on  the  cavity  accompanying  changes  in  the  two  most  crucial  bonds 
of  the  flexible  chain  of  each  cholic  acid  arm.  The  following  table  shows  the  single 
modifications  which  were  made. 


Dihedral  Angles 

Name 

Bond 

20-22  (b) 

Bond  g  to  Aromatic  Ring 

Arm  A 

Arm  B 

Arm  A 

Arm  B 

90b 

-60.6 

+60.6 

-90 

+90 

90bE 

-60.6 

+60.6 

+90 

+90 

90E 

+60.6 

+60.6 

+90 

+90 

120E 

+60.6 

+60.6 

-120 

-120 

150MI* 

-60.6 

-60.6 

+150 

-150 

*  the  total  mirror  image,  created  by  reflection  of  X-coordinates  through  YZ  plane 


The  _90b  has  a  negative  dihedral  angle  b  and  does  not  create  a  clam  shape  since  the 
hydroxyls  on  arm  A  are  directed  outward  instead  of  inward  (Fig.  4).  Structure  _90bE, 
the  extended  form  of  _90b,  and  _90E,  _1 20E,  the  extended  forms  of  _90  Clam  and 
_120  Clam  respectively,  illustrate  that  an  open  conformation  (as  opposed  to  a  folded 
conformation)  can  be  created  with  a  single  dihedral  angle  modification,  namely  g  in  arm 
A  or  B  (Fig.  5,  _90Clam  and  _90E).  The  total  mirror  image  of  one  conformation,  the 
_150  Clam,  shows  the  same  size  cavity  but  opposite  orientation  in  space  (Fig.  6).  None 
of  these  alternate  conformations  wll  be  considered  as  candidates  for  docking  the  sugar 
guests. 
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As  with  the  m-xylylene  spacer,  the  torsional  angles  for  bonds  labeled  a,  b,  c,  d,  and  e 
were  obtained  from  crystallographic  coordinates  (refcode  =  CUBSUN  for  a,  modification 
of  others  to  mimic  BUGJES),  while  f  and  g  were  set  to  create  the  reference 
conformation  shown  below.  Dihedral  angles  a,  c,  d,  a,  and  f  remained  fixed  in  both 
halves  of  the  cholic  acid  host,  while  b  (C>|  7--C2o*-C22"C23) and  0  (benzylic  C-Cj  — 
C2—Har)  were  varied.  Again,  dihedral  angle  b  has  the  most  direct  effect  on  the  cholic 


acid  arm  vector  (directionality),  whereas  dihedral  angle  g  alleviates  the  bad  steric 


interactions,  found  in  the  reference  torsional  angle  of  0  degrees,  between  the  hydrogen 


Reference 

a  =  -170.3 
b  =  +60.6 
c  =  +179.3 
d  =  f  180.0 
e  a  +180.0 
f  =+180.0 
g  =  0 


OH 


The  “standard”  for  this  cholic  acid  host  is  defined  as  shown:  The  molecule  starts  at  the 
left  and  proceeds  along  arm  A  through  the  aromatic  spacer  toward  arm  B,  always  with  a 
negative  g  dihedral  angle  in  arm  A,  using  CV  and  C2’.  This  dihedral  angle  is 
comparable  to  that  used  in  the  ‘standard”  for  the  m-xylylene  spacer.  Dihedral  angle  g 
in  arm  B  is  measured  using  and  C3'. 


Avm  A  ~ 

0  defined  as 

N-C-C.-C^  N&JATW€ 


a  defined  as 
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The  p-xyiyiene  aromatic  spacer  provides  a  more  symmetrical  clam  shape  due  to  the 
para  substitution  of  the  steroidal  arms  onto  the  benzene  ring.  In  addition,  the  N-H  to 
C2-H  interaction  is  not  as  severe.  Therefore,  since  the  considerations  of  geometry 

were  different  here,  the  studies  with  this  host  proceeded  differently  than  with  the  m- 
xylylene  fragment.  Two  combinations  of  dihedral  angles  b  and  g  giving  folded  “C" 
shapes  were  studied,  along  with  two  extended  “S”  shapes.  The  starting  structures  in 
this  series  were  optimized  energetically  (full  geometry,  no  electrostatics)  using  the 
MAXIMIN2  force  field  found  in  the  SYBYL  Molecular  Modeling  Software  (Appendix  A). 
The  resulting  “C”  shapes  widened  in  the  middle  while  moving  closer  together  at  the 
ends,  the  A  rings  of  the  steroidal  arms.  The  “S"  shapes  tightened  in  their  shape  by 
curving  more  toward  the  spacer.  The  following  table  shows  both  the  beginning  and 
final  energies  and  dihedral  angles  for  each  of  the  p-xylylene  cholic  acid  hosts. 


Name 

Energy 

Dihedral  Angles  for  Arm  A .  Arm  B 

Bond  20-22  Ibi  Bond  a 

_PX 

_PX_MAXa 

+817 

+65.1 

+60.6,  +60.6 
+62.6,  +62.7 

-90,  +90 
-82.6,  +97 

PX  0 

_PX_0_MAX 

+1028 

+71.06* 

+60.6,  0 
+59.9,  -35.4 

-90,  +90 
-73.3,  +105 

PX  OPENb 
_PX_OPEN_MAX 

+817 

+67.16 

+60.6,  +60.6 
+63.5,  +63.7 

-90,  -90 
-85.5,  -85.7 

PX  0EC 
PXnEMAX 

+1029 

+74.54** 

+60.6,  0 
+56.6,  -38.0 

-90,  -90 
-79.5,  -71.8 

*  +0.04  away  from  convergence  at  +71 .023 
**  +0.04  away  from  convergence  at  +74.40 

a  _MAX  =  MAXIMIN2  result,  all  _MAX  results  were  fit  to  _PX 

b  the  “open"  or  extended  form  of  _PX 
c  the  extended  form  of  PX  0 


The  components  of  the  total  energy  are  given  in  Appendix  B.  The  initial  difference 
between  the  two  90  degree  clam  shapes  is  shown  in  Fig.  7.  Initial  and  final  structures 
_PX_0  and  _PX_0_MAX  are  shown  in  Fig.  8,  illustrating  the  typical  change  in  the  “C" 
clam  shape.  The  structure  of  _PX_MAX  is  shown  in  Fig.  9.  Finally,  Fig.  10  shows  how 
minimization  alters  the  “S”  shape  of  an  extended  p-xylylene  cholic  acid  host. 
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The  atom  pair  distances  indicative  of  the  cavities  created  in  the  two  “C"  shapes  (both 
before  and  after  minimization)  were: 


Name  A  Distance  Arm  A  //  B 

03 

07 

012 

Cavitv  Width  fAl 

03 

10.00 

12.72 

9.34 

_PX 

07 

12.67 

14.72 

11.29 

8.81 

012 

9.26 

11.27 

8.81 

A  //  B 

03 

07 

012 

03  1 

998 

12.68 

9.12 

_PX_MAX 

07 

12.63 

14.86 

11.41 

9.08 

012 

9.04 

11.40 

9.08 

A  //  B 

03 

07 

012 

03 

3.01 

6.54 

8.50 

_PX_0 

07 

7.48 

8.28 

10.57 

5.47,  10.57* 

012 

7.78 

6.47 

9.63 

A  //  B 

03 

07 

012 

03 

2.90 

6.56 

8.32 

_PX_0_MAX 

07 

7.31 

9.20 

10.85 

6.66,  10.85 

012 

7.78 

6.47 

9.63 

'cavity  is  larger  in  the  middle  than  at  the  ends 


The  two  90  degree  dam  shape  cavities  formed  from  the  p-xyiyiene  aromatic  spacer, 
with  widths  of  9-1 1  A  and  lengths  of  12  A,  are  reasonable  starting  points  for 
incorporating  the  monosaccharide  moiety.  The  energy  minimizations  did  not 
significantly  alter  the  cavity  sizes  but  optimized  the  overall  geometry  (Fig.  8).  The 
extended  forms  of  these  hosts  are  created  by  rotation  about  a  singe  bond,  g  in  arm  B, 
from  the  flexible  chain  to  the  aromatic  ring.  If  the  molecule  exists  in  an  extended  form, 
for  example  conformation  _PX_OPEN_MAX,  the  distance  from  the  end  of  the  steroid 
arm  B  to  the  plane  of  the  xylylyl  fragment  is  1 2.03  A  (Fig.  1 1 ).  Therefore,  to  create  a 
dam  or  "C"  shape,  this  arm  would  have  to  swing  around  on  the  hinge  to  cover  almost 
25Al 

To  complete  this  phase  of  the  cholic  add  host  system  geometry  evaluation,  energy 
minimizations  on  the  m-xylylene  strudures  must  be  done  and  the  accompanying 
change  in  cavity  size  and/or  shape  must  be  studied.  Docking  a  monosaccharide  guest, 
n-pentyl  beta-D-glucoside,  has  begun  in  the  p-xylylene  cholic  acid  hosts  by  first  visually 
docking  the  sugar  and  then  optimizing  the  whole  complex  using  MAXIMIN2.  These 
studies  are  preliminary  since  only  the  steric  (no  electrostatic)  component  to  the  energy 
has  been  induded  in  the  evaluation  of  the  host  guest  complexes. 
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Hp»t  2.  Schematic  diipug  of  the  intricate  networks  of  bydr of en  bonds  formed  in  the  complca  of  the  t-siabinose-bindiiig 
protein  with  the  L-erabinose  substrate.  SheD  I  represents  the  essential  residues  hydrogen-bonded  to  the  sugars  and  to  adjacent 
second  shed  (shell  II)  of  residues.  Note  especially  that  Arg-151  is  not  involved  in  salt  linkages  but  is  the  source  of  five  donor 
groups  for  ftvt  hydrogen  bonds.  (Adapted  from  Quiocho  and  Vym  1964.) 
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